High performance gain cell architecture

ABSTRACT

A memory architecture that utilizes single-ended dual-port destructive write memory cells and a local write-back buffer is described. Each cell has separate read and write ports that make it possible to read-out data from cells on one wordline in the array, and subsequently write-back to those cells while simultaneously reading-out the cell on another wordline in the array. By implementing an array of sense amplifiers such that one amplifier is coupled to each read bitline, and a latch receiving the result of the sensed data and delivering this data to the write data lines, it is possible to ‘pipeline’ the read-out and write-back phases of the read cycle. This allows for a write-back phase from one cycle to occur simultaneously with the read-out phase of another cycle. By extending the operation of the latch to accept data either from the sense amplifier, or from the memory data inputs, modified by the column address and masking bits, it is also possible to pipeline the read-out and the modify-write-back phases of a write cycle, allowing them to occur simultaneously. The architecture preferably employs a nondestructive read memory cell such as 2T or 3T gain cells, achieving an SRAM-like cycle and access times with a smaller and more SER immune memory cell.

BACKGROUND OF INVENTION

[0001] This invention generally relates to an embedded dynamic random access memory (embedded DRAM), and more particularly, to a high-performance DRAM architecture utilizing gain cells.

[0002] Memory arrays constructed of six transistor static memory cells (6T SRAMs) are generally known to have faster access time and cycle time when compared to single-ended dynamic cells.

[0003]FIG. 1A shows a transistor level schematic of a conventional 6T SRAM cell 10 consisting of four NMOS transistors 1, 2, 5, and 6, and two PMOS transistors 3 and 4. PMOS 3 and 4 and NMOS 5 and 6 form a CMOS cross-coupled latch, which maintains a data bit as a storage element. NMOS transistors 1 and 2 couple nodes 7 and 8 to biltlines BL1 and BL2 when activated by wordline WL, allowing the data bit to read to or written from BL1 and BL2. On the other hand, single-ended dynamic cells are known to be smaller and benefit from significantly reduced soft-error-rate at small geometries.

[0004]FIG. 2A is a transistor level schematic of a conventional single-ended dynamic cell 20. It consists of one NMOS transistor 21 and capacitor 22 (1T DRAM cell). When the wordline WL is activated, NMOS 21 couples capacitor 22 to the bitline BL, allowing the data bit stored in capacitor 22 to be read to or written from BL.

[0005] Several reasons exist to explain the difference in performance. From a functional standpoint, the SRAM cell 10 shown in FIG. 1B can be described as one having three signal connections attached to each cell consisting of one storage element and two switches 11 and 12. These are: wordline (WL) and bitlines 1 (BL1) and 2 (BL2). Cells are arranged in a matrix formation with a wordline connecting a plurality of cells in one direction and bitlines in an orthogonal arrangement. Switches 11 and 12 are controlled by wordline WL such that storage element 15 can be accessed by bitlines BL1 and BL2. Data is preserved in the SRAM cell for many cycles as long as power is maintained, and as long as the wordline servicing the cells is not activated. In a non-activated condition, because switches 11 and 12 are opened, the SRAM cell presents a high impedance to the BL1 and BL2 connections. When the wordline is activated, switches 11 and 12 couple storage element 15 to bitlines BL1 and BL2. As a result, the SRAM cell displays a different impedance to bitlines BL1 and BL2 depending upon the state of the memory cell. For a cell storing a logic ‘1’, BL1 displays a lower impedance than BL2 at an impedance value that is slightly less than that in the non-activated case. A logic “1” represents the condition necessary for the CMOS cross-coupled latch in FIG. 1A to maintain a “0” and “1” at nodes 7 and 8, respectively.

[0006] Assuming that BL1 and BL2 are precharged to VDD, the impedance of NMOS switch 1 in FIG. 1A is lower than that of NMOS switch 2. For a cell storing a ‘0’, bitline_2 port will have a lower impedance than the bitline_1 port which, in turn, will display an impedance that is slightly less than the non-activated case. Thus, a “0” is the state wherein the CMOS cross-coupled latch maintains a 1 and 0 at nodes 7 and 8, respectively. Assuming having BL1 and BL2 precharged to VDD, then, NMOS switch 2 impedance is lower than the impedance of NMOS switch 1. This difference in impedances can be used to sense the state of the memory cell using one of several existing techniques. The state of the static memory cell is not disturbed by reading it; hence it is known to have a non-destructive read-out (NDRO). The cell is written by asserting the wordline while forcing a reference voltage on bitline_1 to write one state, or by forcing a reference voltage on bitline_2 to write the other state. If no bitline is forced (or if both are forced to the same direction), the cell will not be written, but instead will maintain its previous state. This is known as a non-destructive write (NDW).

[0007] The aforementioned characteristics are used to enhance performance using the following techniques since the read-out operation is nondestructive and the bitlines may be precharged back to their ‘ready’ state while the wordline is still activate. This characteristic makes it possible to reduce the time of a random access cycle. When a read-out is nondestructive, it is possible to multiplex several bitlines into a single sense amplifier, allowing the use of a larger area for each sense amplifier, and hence more complex, higher performance circuits. Moreover, since bitlines produce differential signals from each cell, the bitline signal is ‘self-referenced’, essentially reducing noise, improving the signal/noise ratio, and allowing a faster amplification of the signal. Furthermore, since the SRAM cell develops an impedance difference when read, it is possible to use current sensing techniques to sense the signal. These are known to be faster than voltage sensing techniques (traditionally used to sense dynamic cells) particularly for arrays having many bits per bitline. Since the SRAM cell can be read nondestructively (NDRO), the cell need not be written-back after it is read-out (as compared to traditional dynamic cells). Also, because the SRAM cell enables a nondestructive write (NDW), it is possible to perform a write operation only for selected cells even if more cells are activated by the corresponding wordline. Unselected data bits are typically maintained by floating bitlines (or by forcing both bitlines to high). Accordingly, the non-destructive read (NDRO) and non-destructive write (NDW) features have the most significant impact on the cycle time of a conventional DRAM cell array when compared to an SRAM cell.

[0008] From a functional standpoint, the single ended dynamic cell 20 can be described as one having two signal connections attached to each cell including one storage element 22 and one switch 21 (FIG. 2B). The connections are: wordline (WL) and bitline (BL). Because of the nature of the dynamic cell, when WL is activated, data in the storage element is destroyed (destructive read). The destroyed data needs then to be written back into the storage element increasing the read cycle time. Because of the write back requirement, DRAMs typically use a CMOS cross-coupled sense amplifier SA. Additionally, the write cycle time is equal or slower than the read cycle time, unless all the bits coupled to the wordline are simultaneously written. This may be explained by the fact that unselected cells for the write operation are destroyed in a manner similar to the destructive read operation, requiring a sensing and write back operation (read modified write), necessitating a longer write cycle time.

[0009] Accordingly, it is important to solve these two problems to enable the SRAM like cycle time. Techniques are known to utilize a destructive read/destructive write 1T1C DRAM cell in combination with a write-back buffer array to realize a dynamic memory with cycle time that approaches that of an SRAM. One known technique is a memory architecture in which the read-out of one DRAM array occurs simultaneously with the write-back operation in a separate DRAM array of the same memory. A buffer array is used to resolve data conflicts. Data management techniques inherent to this solution, however, tend to increase the access time. Hence it is difficult to use such a technique to achieve an SRAM-like access time.

[0010]FIG. 3A shows a prior art single-ended multi-port destructive write memory cell 30 consisting of a storage element 33 and two switches (31 and 32). The single ended multi port destructive write memory cell is characterized by having four wires connected to each cell:

[0011] a read bitline (RBL) traversing the array in one direction,

[0012] a write bitline (WBL) traversing the array in a direction parallel to the read bitline;

[0013] a read wordline (RWL) traversing the array of cells in a direction orthogonal to the bitlines;

[0014] and a write wordline (WWL) traversing the array of cells in a direction parallel to the read wordline.

[0015] Data is stored in the cell for as many cycles as necessary as long as the read wordline and write wordline are not activated. The cell displays high impedance at its read bitline port as long as the read wordline is not activated. When the read wordline RWL is activated, the switch 30 couples storage element 33 to read bitline RBL. This enables the non-destructive read cell to provide an impedance to the read bitline (RBL) which depends on the logic value stored in a cell similar to the 6T SRAM cell. This occurs in the class of single-ended multi-port cells which have nondestructive read-out.

[0016]FIGS. 3B-3C show conventional nondestructive read and destructive write memory cells comprising three transistors (3T cell) and two transistors (2T cell), respectively, or a destructive read cell such as one having a single capacitor, wherein the storage element shares the charge that it is stored in the capacitor in the read bitline (RBL). This occurs in the class of single-ended multi-port cells which have destructive read-out.

[0017]FIG. 3D shows a prior art destructive read and destructive write memory cell consisting of two transistors and one capacitor (2T 1C cell).

[0018] Regardless whether a 3T, 2T, or 2T 1C cells are used, when a write wordline is activated, the cell takes on the logic state that is forced onto the write bitline by opening switch 32 (FIG. 3A), coupling it to the write BL (WBL). Every cell which is connected by a write wordline being activated is written into. There is no option for activating a wordline and having the cell maintain its previous state because of a destructive write. Therefore, it is required to write all the bits coupling to the activated WWL simultaneously. Otherwise, the write cycle time is limited by the read modify write back operation similar to the DRAM. However, writing all the bits along the wordline concurrently requires rearranging the data lines and associated drivers with the same periodicity as the memory cells. Such a solution requires significant area and hence is expensive. Also, the number of cells coupled to a wordline must be equal or less than the number of the write data bits. This requirement is not practical for a “narrow” I/O organization, such as ×16, or even for ×32 or ×64. Using a single-ended multi-port non-destructive write memory cell may overcome the problem. However, it requires at least an equal number, or preferably, more transistors than the conventional 6T SRAM cell and is, therefore, undesirable for being expensive.

[0019] In a typical destructive read array, the read cycle consists of a read-out, a write-back and a precharged phase. In the conventional destructive write array, all the data bits coupled to a wordline are simultaneously written. Otherwise, the write cycle consists of a read-out, a modify-write-back and a precharged phase (read modified write). Hence the typical operation of a single-ended destructive write multi-port memory cell ends up having a longer cycle than that of a nondestructive read/write 6T SRAM.

SUMMARY OF INVENTION

[0020] It is an object of the invention to provide a memory formed with single-ended multi-port destructive write memory cells.

[0021] It is another object to provide a memory architecture that makes it possible to have a write-back phase from one cycle to occur simultaneously with the read-out phase of another cycle.

[0022] The foregoing and other objects of the invention are realized by a circuit and an architecture that utilize single-ended dual-port destructive write memory cells and a local write-back buffer. Each cell has separate read and write ports that make it possible to read-out data from cells along one wordline in the array, and immediately thereafter write-back data to the same cell while simultaneously cells on another wordline in the array are being read-out. Thus, by implementing an array of sense amplifiers such that one amplifier is coupled to each read bitline, and a latch receiving the result of the sensed data and delivering this data to the write bitlines, it is possible to ‘pipeline’ the read-out and write-back phases of the read cycle. This allows for a write-back phase from one cycle to occur simultaneously with the read-out phase of another cycle.

[0023] By extending the operation of the latch to accept data either from the sense amplifier, or from the memory data inputs, modified by the column address and masking bits, it is also possible to pipeline the read-out and the modify write-back phases, allowing them to occur simultaneously for different wordlines. The architecture preferably employs a nondestructive read memory cell such as 2T or 3T gain cells, achieving an SRAM-like cycle and access times with a smaller and more SER immune memory cell.

[0024] Alternatively, the architecture is provided with a destructive read memory cell such as a 2T 1C cell, which achieves the same performance with a destructive read architecture employed in other, known destructive read architectures, but with a simpler destructive read scheduling.

[0025] In a first aspect of the invention, there is provided an array of memory cells, respectively coupled to read and write bitlines, each read bit line being attached to a read sensing circuits, and each write bitlline connected to a write drivers, a latch and a multiplexer, as described above. A series of write wordlines connect the write activation ports on many cells in a manner that is orthogonal to the read and write bitlines. Similarly a series of read wordlines connects the read activation port on many cells in an orthogonal arrangement to the read and write bitlines. Read and write wordlines are asserted by a wordline decoder and driver circuits in such a manner so that one read wordline and one write wordline can be asserted simultaneously.

[0026] In a second aspect of the invention, the row decoder circuit is augmented by a latch located at each decoder output. Using this latch, the row decoder can first select a read wordline for activation. The wordline selection is stored in the latch so that the decoder is free to select the next read wordline while the last latched wordline selection is used to activate the write wordline.

[0027] The sequence of events to perform a read operation of the above array of memory cells is as follows: i) row address decoding; ii) read wordline assertion; iii) read data transfer from the selected memory cells to the read bitlines; iv) detection of the data on the read bitlines by the read sense circuits; v) transfer of the digital data from the read sense circuits both to the readout port of the memory array and to the latch associated with the read bitline; and vi)de-assertion of the read wordline.

[0028] The sequence of events to perform a write operation in the above array of memory cells is as follows: i) row address decoding; ii)read wordline assertion; iii) read data transfer from the selected memory cells to the read bitlines; iv) detection of the data on the read bitlines by the read sense circuits; v) transfer of the digital data from the read sense circuits to the latch associated with the read bitline; vi) de-assertion of the read wordline and transfer of the wordline selection to the write wordline latch; vii) logical combination of data in the read bitline latch with input signals such as column addresses, mask bit and data bits to form the appropriate state presented to the write bitline driver; viii) write wordline assertion using the wordline indicated in the wordline selection latch; ix) driving the write bitlines with the calculated write bits; x) modification of the memory cell based on the data on the cells write port; and xi) de-assertion of the write wordline and clearing of the wordline selection latches.

[0029] A “pipeline” operation of the memory array is such that some or all of the operations vii) to xi) of the write cycle operating at one wordline address may overlap, i.e., they occur concurrently with operations i) to vi) of the write cycle, or operations i) to vi) of the read cycle at a different wordline address.

[0030] The aforementioned memory array may also operate in a mode where the memory cells are read-out destructively. In this case, a read or a refresh cycle includes the same operation steps as the write cycle previously described, except for operation vii) which does not accept signals from outside the array, such that data from the read sense circuits is not modified before being delivered to the write driver. Additionally, some or all of operations vii) to xi) of the write cycle operating and one wordline address may overlap in time with operations i) to vi) of the refresh or read cycle. Also some or all of operations vii) to xi) of the refresh cycle may overlap with operations i) to vi) of the refresh cycle, and operations i) to vi) of a write cycle or operations i) to iv) of the read cycle.

BRIEF DESCRIPTION OF DRAWINGS

[0031] The accompanying drawings, which are incorporated in and which constitute a part of the specification, illustrate presently preferred embodiments of the invention and, together with the general description given above and the detailed description of the preferred embodiments given hereinafter serve to explain the principles of the invention.

[0032]FIG. 1A is a transistor level schematic showing a prior art 6T SRAM cell.

[0033]FIG. 1B is a functional representation of the prior art SRAM cell shown in FIG. 1A consisting of one storage element and two switches.

[0034]FIGS. 2A and 2B are, respectively, a transistor level schematic and a functional representation of a prior art single-ended DRAM cell.

[0035]FIG. 3A is a schematic diagram of a prior art single-ended multi-port destructive write memory cell consisting of a storage element and two switches.

[0036]FIGS. 3B and 3C respectively show typical prior art nondestructive read and destructive write memory cells consisting of 3 transistors (3T cell) and 2 transistors (2T cell).

[0037]FIG. 4 shows a single ended destructive write multi-port memory array architecture consisting of a plurality of a single ended destructive write multi-port memory cells, each including a read port, a write port and a storage element, according to the present invention.

[0038]FIG. 5 shows how the concepts of the present invention apply to a prior art 2-port 3T gain cell by enabling a non-destructive read and a destructive write operation.

[0039]FIG. 6A is a transistor level schematic of a column shown in FIG. 4, with each column consisting of a plurality of 3T memory cells coupled to a plurality of RWL and WRL pairs, and one RBL and WBL pair, according to the invention.

[0040]FIG. 6B shows the internal timing diagram for several commands, i.e., write 0 for memory cell i (W0i), write command 1 for the memory cell j (W1j), read memory cell i (Ri) and read memory cell j (Rj).

[0041]FIG. 7 shows a block diagram consisting of an address match detection circuit, a write bit register and a multiplexer.

DETAILED DESCRIPTION

[0042]FIG. 4 shows a single-ended destructive write multi-port memory array architecture, according to the present invention. The memory architecture 40 includes a plurality of single-ended destructive write multi-port memory cells 30, each consisting of a read port 31, a write port 32, and a storage element 33. The cells are organized in a matrix formation by way of a plurality of rows, each coupled to the read wordline RWL and write wordline WWL, and a plurality of columns, each coupled to read bitline RBL and to write bitline WBL. Each RWL and WWL is supported by a common row decoder 41, a read wordline driver 42, and write wordline latch and driver 43. Alternatively, each read and write wordline drivers 42 and 43 have an independent row decoder 41. Each read bitline RBL is supported by a read sense amplifier 44 and a precharged device or resistor 49, and each write bitline WBL, by bitline write circuit 45. A common column decoder 46 services a read column select driver 47 and a write column select driver 48. Alternatively, each read and write column select drivers 46 and 47 are provided with an independent column decoder 46.

[0043] The discussion following hereinafter assumes a synchronous pseudo-SRAM interface, which supports read, write, and refresh operations at each clock cycle.

[0044] Read Mode:

[0045] When a read command is given at which time the system clock CLK switches to high, the row decoders 41 and column decoders 46 enable, respectively, the corresponding read wordline driver 42 and read column select driver 47. The respective RWL thus switches to high, and opens read ports 31 of the corresponding memory cells 30. As a result, data bits in the appropriate memory cells 30 are read out to the corresponding RBLs. Concurrently, signal read column select RCSL switches to high, opening read column switch 31, allowing the corresponding read sense amplifier 45 to couple to the local read data line LRDL. Read BL precharged device 49 is preferably kept on for current sensing of the non-destructive memory cell. However, it may be turned off during RWL activation to conserve power. Alternatively, it may be turned off prior to activating RWL for a destructive read memory cell. Regardless whether a non-destructive read memory cell or a destructive read cell, a current mode sense amplifier 44 is preferably used, although the present invention is not limited to only this configuration.

[0046] Sense amplifier 44 senses the data bit generated in each RBL, which is then automatically transferred to LRDL and PRDL, allowing an SRAM-like access behavior, particularly, if a non-destructive read cell is used. For a destructive read memory cell, the destructive read operation is preferably used for creating a preconditioned write by choosing a proper BL precharged voltage, such as VDD or Â½VDD. For a write-back operation following the read-out, the sensed data bits are transferred to write buffer 45, which occurs when delayed clock CLKI switches to high. When this occurs, the write wordline latch and driver 43 activates the corresponding WWL. When a transition to high takes place, WWL opens the corresponding write ports 32 of memory cells 30. Concurrently, write bitline driver 45 drives the read data bits to WBL, allowing the read data bit to be transferred from read port 31 of memory cell 30 to be written back to write port 32 of the memory cell through RBL and WBL, and within two cycles. As long as each read and write cycle is equal or less than the clock cycle, the next read command will be accepted as a “pipeline” allowing a single cycle read access time. Alternatively, for a non-destructive read array, the write back cycle may be disabled to save power.

[0047] Write Mode:

[0048] When a write command is given at which time the system clock CLK switches to high, row decoders 41 and column decoders 46 enable the appropriate read wordline driver 42. The corresponding RWL thus switches to high, and opens read ports 31 of the corresponding memory cells 30. As a result, data bits in the matching memory cells 30 are read out to the corresponding RBLs. The read BL precharged device 49 may be kept on, preferably, for current sensing of the nondestructive memory cell. However, it may be turned off during the activation of RWL to conserve power. Alternatively, one may turn it off prior to activating RWL, i.e., in an instance of a destructive read memory cell. Regardless whether a destructive read memory cell or non-destructive read cell, current mode sense amplifier 44 senses the data bit developed on each RBL. The read-out data is transferred to the write data latch 45.

[0049] Concurrently, write data bits are driven to the primary write data line (PWDL) and local write data line. At the same time, the CLKI enables a write column select driver 48, activating the signal WCSL. The activation of the WCSL transfers the write data bit from the LWDL to the selected write bitline circuit 45, overriding the data from the read sense amplifier. When the CLKI switches to high, the write wordline driver 43 asserts the corresponding WWL. Switching WWL to high opens the corresponding write ports 32 of the memory cells 30 enabling a simultaneous write back to cells on un-selected columns and write to cells on the selected columns.

[0050] Optionally, by integrating the disable function of the WCSL with a mask bit control, the write bits end up also masked. This is well known in the art and, consequently, will not be discussed further. The foregoing allows the write data bit driven from WPDL to be written to a write port of the same memory cell, keeping the destructive write memory cell deselected by transferring the data bits form the read port of the memory cell to the write port of the same memory cell via RBL and WBL over two cycles. As long as each read or each write cycle is equal or less than the clock cycle, the next read command can be accepted as a “pipeline”, to allow a single write access cycle.

[0051] Refresh Mode:

[0052] When a refresh command is given at which time a system clock CLK switches to high, row decodes 41 enables the corresponding read wordline driver 42. The corresponding RWL switches to high, opening read ports 31 of the corresponding memory cells 30. As a result, data bits in the corresponding memory cells 30 are read out to the corresponding RBLs. The read BL precharged device 49 may preferably be kept on for current sensing of a non-destructive memory cell. However, it may be turned off during RWL activation to conserve power. Alternatively, it may be turned off prior to activating RWL for a destructive read memory cell. Regardless whether a destructive read memory cell or a non-destructive read cell are used, a current mode sense amplifier 44 may be utilized, although the invention is not limited in this configuration.

[0053] The current mode sense amplifier 44 senses the data bit generated on RBL. This makes it possible to achieve an SRAM-like access performance, particularly, if a non-destructive read cell is employed. For a destructive read memory cell, a destructive read operation is preferably used for creating a preconditioned write by selecting the appropriate BL precharged voltage, such as VDD or Â½VDD. For the write back operation following the read-out mode, the sensed data bits are transferred to the write buffer 45 when the delayed clock CLKI switches to high. When CLKI switches to high, write wordline driver 43 activates the corresponding WWL. Switching WWL to high opens the corresponding write ports 32 of the memory cells 30. Simultaneously, write bitline driver 45 drives the read data bits to WBL. This allows the read data bit that was transferred from the read port of the memory cell to be written back to the write port of the same memory cell through the RBL and the WBL within two cycles. As long as each read or write cycle is equal less than the clock cycle, the next read command can be accepted as a pipeline allowing a single cycle refresh access cycle time.

[0054] The concepts discussed above will now be more clearly understood by way of the following preferred embodiment applicable to a prior art 3T gain cell.

[0055]FIG. 5 shows a conventional 2-port 3T gain cell that enables a non-destructive read and a destructive write operation. The 3T gain cell 50 consists of three NMOS transistors 51, 52 and 53A. A data bit is maintained by gate capacitor 53A as a storage element. Optionally, an additional capacitor 53B may be integrated to improve the data retention time. The data bit at the node S can be read-out to the read bitline RBL when the read wordline RWL coupled to the gate of NMOS31 switches to high. Maintaining node S to high discharges RBL. If node S is at low, RBL remains in a high state. When write wordline WWL switches to high, the source node S becomes coupled to write bitline WBL. Thus, the data bits at the node S can be changed by WBL.

[0056]FIG. 6A is a transistor level schematic diagram of the column shown in FIG. 4. Each column 60 consists of a plurality of 3T memory cells 30 i and 30 j (FIG. 3) coupled to a plurality of RWL-WWL pairs and one RBL-WBL pair (FIG. 4). A read bitline RBL is precharged to VDD by PMOS 49. When RWL switches to high, RBL remains at high if node S is at 0, or it may discharge a few hundred millivolts from VDD. The RBL voltage level is determined by the conductance ratio between PMOS 49 and the 3T cell memory cell. A current mirror sense amplifier 44 (FIG. 4) compares RBL to a reference voltage VREF, which is set at a level half-way between VDD and the voltage to which RBL discharges to. If the voltage at RBL is higher than VREF, nodes SO and bSO are then, respectively kept at 0 and 1. If the RBL is lower than VREF, nodes SO and bSO will switch to 1 and 0, respectively. The current mirror sense amplifier 44 is enabled when signal ENABLE switches to high. A signal RCSL activates read column switch 66 to transfer the data bit from the current mirror sense amplifier 44 to the local read data line LRDL. It is preferably driven in order to transfer a signal to the primarily read data line PRDL. The signal transfer to the PRDL occurs immediately following RWL being activated. This allows an SRAM like read access performance. Bitline write circuit 61 includes a simple CMOS latch which is driven either by the current mirror sense amplifier 44 when both write column switches 62 and 63 are closed, or by the data on the LWDL line when the write column switch 64 closes. In a read or data refresh modes, write column select signal WCSL and bWCSL remain, respectively, at low and high. When the delayed clock CLKI switches to high, the node bSO at sense amplifier 44 drives CMOS latch 61, allowing the sensed result to be transferred to WBL. CLKI is preferably generated by sensing the state of cells in a dummy column in the array which is identical to a regular column in every what except the cells are pre-conditioned so that they will always store a “1” data state. The RBL in the dummy column, therefore, discharges when the RWL switches to high, which generates signal CLKI and guarantees the data bits to be ready for the write back operation to all cells on the wordline.

[0057] Two situations exist for a write cycle. For the selected columns, WCSL switches to high, allowing write BL circuit 61 to be driven by LWDL. Note that the write path from the current mirror sense amplifier 44 is disabled by bWCSL. For the unselected columns, write BL circuit 44 is driven by the corresponding current mirror sense amplifier 44, allowing the sensed data bits to be written back to the corresponding memory cells. Optionally, the data mask function may be integrated by disabling the WCSL selection. This is well known in the art, and will not be discussed further.

[0058]FIG. 6B shows the internal timing diagram for the given commands write 0 for memory cell i (W0i), write command 1 for the memory cell j (W1j), read memory cell i (Ri) and read memory cell j (Rj). It is assumed that memory cells 0 and 1 originally store 1 and 0, respectively.

[0059] The “pipelined” write followed by read architecture described above works well if the consecutive addresses are different. In view of the non-destructive read nature, two consecutive read commands (or read and write) for the same row address can also be serviced because the memory cells are forced by the same data bits read out from the same cell. For two consecutive write commands for the same row address, the BL write circuit should only be updated by the write command by forcing bWCSL while enabling WCSL. A write followed by a read for the same row address requires special data handling. Since the read-out and modify-write-back operations associated with the write cycle are pipelined, the bits written are not yet stored in the memory cell when the next read command is issued. Note that this is the only concern for the updated write bits, whereas other unselected write bits provide the same operation as a read mode. Consequently, a problem occurs only when the row and column addresses are exactly the same as for the previous write command.

[0060]FIG. 7 is a block diagram of an address match detection circuit, write bit register and multiplexer. The address match detection compares the address input with the previous address, and generates a signal REGOUT that remains at 0 if no match occurs, allowing the read data bits to be read from the memory array. If a match occurs, the signal REGOUT switches to high, allowing the data bits to be read out from the register which has been loaded by the write data of the last command.

[0061] For the destructive read and destructive write multi-port memory cell shown in FIG. 3D, VDD or Â½VDD BL precharged voltage is preferably chosen. This allows a destructive read operation to be a preconditioning write operation at VDD or Â½VDD. It also results in a faster write operation for the following write operation, further improving the write cycle time.

[0062] While the invention has been described in terms of a preferred embodiment, various alternative and modifications can be devised by those skilled in the art without departing from the invention. Accordingly, the present invention is intended to embrace all such alternatives which fall within the scope of the appended claims. 

1. A memory array comprising: a plurality of single-ended, dual-port, destructive-write memory cells arranged in an array formation of rows and columns, each cell having a read port consisting of a read data terminal and a read activation terminal, and further a write port consisting of a write data terminal and a write activation terminal, the read data terminal being accessed by asserting the cell read activation terminal and the write data terminal being accessed by asserting the cell write activation terminal, wherein in each column of cells, a read bitline connects all the read data terminals of each cell, each read bitline having a read sense amplifier connected thereto, and a write bitline with an associated write data driver connected to the write data terminal of the memory cells, and wherein when the read activation terminal is asserted, the read data terminal presents data depending upon the stored content of the memory cell, and when the write activation terminal is asserted, the content of the memory cell is set according to the state of the write data terminal; and a circuit associated with each of the read sense amplifiers and each of the write data drivers for holding data sensed by the read sensing amplifier, making data read by the read sense amplifier available to the write data driver.
 2. (cancelled)
 3. The memory array as recited in claim 1, wherein depending on electrical signals generated outside the memory array, the circuit combines data read out from the read sense amplifier and data received from outside the memory array to make it available to the write data driver.
 4. The memory array as recited in claim 1, wherein the write wordlines are coupled to the write activation terminals of the memory cells, the write wordlines being positioned orthogonally to the read and write bitlines, and wherein the plurality of read wordlines are connected to the read activation terminals of the memory cells and positioned orthogonally to the read and write bitlines.
 5. The memory array as recited in claim 3, wherein the read and write wordlines are asserted by a wordline decoder and by corresponding write data drivers, and wherein one read wordline and one write wordline are asserted simultaneously for different cells of the memory array.
 6. The memory array as recited in claim 3, wherein the circuit associated with each read sense amplifier and write data driver holds data sensed by the read sensing amplifier, providing a means for allowing data read-out of the array during one cycle to be modified and written-back to the memory array concurrently with the read-out operation of the next memory cycle.
 7. The memory array as recited in claim 6, wherein the write cycle consists of read-out and write-back phases.
 8. The memory array as recited in claim 3 wherein selected bits of the data read-out are modified by input data provided from outside the memory array before being written-back during a write cycle.
 9. The memory array as recited in claim 3 further comprises an output data terminal wherein selected bits of the read-out data are outputted to outside the memory array.
 10. The memory array as recited in claim 3, wherein circuit elements connect each read sense amplifier to a corresponding write data driver to provide means for refreshing the memory cells.
 11. The memory array as recited in claim 3, wherein a circuit is provided with each row decoder circuit to activate corresponding write wordlines during the write-back phase of a cycle occurring concurrently with the activation of a read wordline.
 12. The memory array as recited in claim 3, wherein the write wordline circuit is a latch connected to a driver
 13. The memory array as recited in claim 3, wherein the write wordline is activated by a delayed clock
 14. The memory array as recited in claim 9, wherein the columns to be read out of the memory array and written into the memory array are selected by a column decoder.
 15. The memory array as recited in claim 14, wherein the column decoder activates a column read switch which connects selected read sense amplifiers to the output terminal of the memory array.
 16. The memory array as recited in claim 14, wherein the column decoder activates a column write switch connecting the memory input port to the write data drivers allowing the selected write data to override data from the read sense amplifier.
 17. The memory array as recited in claim 16, wherein activation of the column write switch is triggered by the delayed clock.
 18. The memory array as recited in claim 1, wherein the memory cells are read-out non-destructively.
 19. The memory array as recited in claim 1, wherein the memory cells are read-out destructively.
 20. The memory array as recited in claim 3, further comprising an address match detection means for detecting when a write cycle is immediately followed by a read cycle at the same address, wherein during a pipelined operation, the read cycle overlaps with a previous write cycle, and wherein when addresses of two operations coincide, input data associated with a write operation and modified by logic operations associated with the latch and the multiplexer is delivered to the memory array output data port.
 21. The memory array as recited in claim 3, wherein the sense amplifiers are single-ended, current mode sense amplifier.
 22. The memory array as recited in claim 3, wherein the sense amplifiers are differential amplifiers.
 23. The memory array as recited in claim 3, wherein a pipeline operation is controlled by at least one clock.
 24. The memory array as recited in claim 3, wherein the memory cells are selected from the group of cells consisting of 3-transistors and 1-capacitor (3T 1C); 2-transistors and 1-capacitor (2T 1C); and 1-transistor and 1-capacitor (1T 1C). 